TAIL ASYMPTOTICS AND ESTIMATION FOR ELLIPTICAL DISTRIBUTIONS 
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Abstract. Let (X, Y) be a bivariate elliptical random vector with associated random radius in the Gumbel max- 
domain of attraction. In this paper we obtain a second order asymptotic expansion of the joint survival probability 
P{X > x,Y > y} for x,y large. Further, based on the asymptotic bounds wc discuss some aspects of the statistical 
modelling of joint survival probabilities and the survival conditional excess probability. 



1. Introduction 

Let (SijS^) be a rotational invariant (spherical) bivariate random vector with associated random radius R := 
y/ Sf + Sf ■ The basic distributional properties of spherical random vectors arc obtained in Cambanis et al. (1981). 
So if R > almost surely, then we have the stochastic representation 

(Si, 5 2 ) = (RU U RU 2 ), 

where the bivariate random vector (Ui, U2) is independent of the associated random radius R and uniformly distributed 

on the unit circle of M 2 (= stands for equality of distribution functions). Linear combinations of spherical random 
vectors define a larger class of random vectors, namely that of elliptical random vectors. Canonical examples are the 
Gaussian and Kotz distributions (see Fang et al. (1990), Kotz et al. (2000), or Reiss and Thomas (2007)). 
In this paper we consider a bivariate elliptical random vector defined in terms of (Si, S2) and the pseudo-correlation 
coefficient p e (—1,1) via the stochastic representation 

(1) (X,Y) = (Si,pSi+ ^l-p 2 S 2 ). 
Since for any a, b two constants (see Lemma 6.2 of Berman (1982)) 

(2) aSi+bS 2 = SiVa 2 + b 2 

we have X = Y = Si. Referring to Cambanis et al. (1981) the random variable Si is symmetric about 0, and 

furthermore Sf = R 2 W, with W a Beta random variable with parameters 1/2, 1/2 independent of i?, implying that 
the distribution function of X and Y is completely known if the distribution function F of R is specified. 

The basic distributional properties of elliptical random vectors are well-known, see e.g., Kotz (1975), Cambanis et 
al. (1981), Anderson and Fang (1990), Fang et. al (1990), Fang and Zhang (1990), Berman (1992), Gupta and Varga 
(1993), Kano (1994), Szablowski (1998), or Kotz et al. (2000) among several others. 

The tail asymptotic behaviour of each component, say of X, can be determined under some assumptions on the tail 
asymptotics of R. The main work in this direction is done in Carnal (1970), Gale (1980), Eddy and Gale (1981), 
Berman (1982, 1983, 1992) among several others. For instance Berman (1992) shows the exact asymptotic behaviour 
of 5*i if R has distribution function in the Gumbel max-domain of attraction. With motivation from Bcrman's work, 
Hashorva (2007) obtains an exact asymptotic expansion of the bivariate survival probability 

P{X > x,Y > ax}, a<l 

letting x tend to 00. See also Asimit and Jones (2007) for a partial result. 

The main impetus for the present article comes from the recent deep contribution Abdous et al. (2007). We derive 
in this paper a refinement of the asymptotic expansion of the joint survival probability obtained in Hashorva (2007). 
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This is achieved assuming some second order asymptotic bounds on the tail asymptotics of the distribution function 
F as suggested in Abdous et al. (2007). 



Our results are of a certain theoretical interest providing detailed asymptotic expansions for a classical problem of 
probability theory - tail asymptotics of random vectors. Further, based on our novel results, we suggest statistical 
estimators of the joint survival probability, conditional distribution and related quantile function. 

We choose the following order for the rest of the paper: In Section 2 we present the main results. Illustrating examples 
follow in Section 3. In Section 4 and Section 5 we discuss some statistical aspects concerning the estimation of the 
joint survival probability, conditional survival and conditional quantile functions. Proofs and related asymptotics are 
relegated to Section 6. 



2. Asymptotic Bounds 

Let {X, Y) be a bivariate elliptical random vector as in JT]) , where the associated random radius R has distribution 
function F with upper endpoint xf G (0, oo] and F(0) = 0. Hashorva (2007) derives an asymptotic expansion of 
the tail probability P{X > x,Y > ax}, a G (— oo, 1] for i | ip assuming that F is in the Gumbel max-domain of 
attraction, i.e., 

,^ I — F(u + s/w(u)) , . , , _ 

(3) Urn ; ' V ;; = exp(-s), Vs G R, 

with w some positive scaling function. Refer to Galambos (1987), Resnick (1987), Reiss (1989), Embrechts et al. 
(1997), Falk et al. (2004), Kotz and Nadarajah (2005), or de Haan and Ferreira (2006) for details on the Gumbel 
max-domain of attraction. 

It is well-known (see e.g., Resnick (1987)) that (|3]) is equivalent with the fact that the distribution function F has the 
following representation 

(4) l-F{x) = d(x)[l-F*(x)}, x<x F , 
with d{x) a positive function converging to d G (0, oo) as x | Xf and 

(5) F*( x ) = 1 — exp(— / w(s)ds), x<xf, 

J z 

where zq is a finite constant in the left neighbourhood of the upper endpoint ip. The distribution F* is referred to 
in the sequel as the von Mises distribution related to F, whereas w as the von Mises scaling function of F. 

Under the Gumbel max-domain of attraction assumption on F (see Hashorva (2007)) we have 

(6) P{X>x,Y >ax} = . (1 + °( 1 )) Cl --P{X > a p , x , ax x}, x -> oo, 

\J xw{a PtX , ax x) 

with ci a known constant, provided that a G (p, 1] and xf = oo, where ot p , x , y is defined by 



(7) a p , x , y := Vl + (j//^-p) 2 /(l-P 2 )- 

For x,y positive such that y / x > a > p we have ct PiXiy > 1. Hence, since the scaling function w satisfies 

(8) lim uw(u) = oo, and lim w(u)(xp — u) = oo, if Xf < oo 

we conclude that the joint tail asymptotics in ([6]) is faster than the componentwise tail asymptotics. 
It turns out that for y < px or y close enough to px the tail asymptotics of interest is up to a constant the same as 
that of P{X > x}. For the bivariate Gaussian distribution it is well-known that this fact is closely related to the so 
called Savage condition, see Dai and Mukherjea (2001), Hashorva and Hiisler (2003), or Hashorva (2005a) for details. 
For elliptical distributions the case where y is close to px has been considered independently in Gale (1980), Eddy 
and Gale (1981) and Berman (1982, 1983, 1992). 

In this paper we are interested in a refinement of |6]) which will be achieved under extra costs related to a second 
order assumption on F. 

Explicitly, as suggested in Abdous et al. (2007) we impose the following assumption: 
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Al. Suppose that the distribution function F satisfies ([4]) with von Mises distribution function F* , and assume 
further that there exist positive measurable functions A4, Bi,i — 1,2 such that for a scaling function w for which ([3]) 
holds we have 

1 -F*(u + x/w(u)) 



(9) 

and 
(10) 



1 -F*(u) 



exp(— x) < A 1 (u)B 1 (x) 



d{u + x i 'w{u)) - d{u) < A 2 (u)B 2 (x) 

for all u large and any x > 0. Furthermore, we assume that hin^^oo Ai (u) = \im u ^ ^ A 2 (u) = 0, and B\,B 2 are 
locally bounded on any compact interval of [0, oo). 

Set in the following (whenever Assumption Al is assumed) 

(11) A{x) := Ax{x) + A 2 (x), B(x) := Bx{x) + exp(-x)B 2 (x), Vx > 0. 

We note in passing that A 2 (x) = B 2 (x) — 0, x > is the original condition in the aforementioned paper, where it is 
shown (see Lemma 7 therein) that the class of distribution functions satisfying Assumption Al is quite large. 

We consider in the sequel only distribution functions F with an infinite upper endpoint. Further we assume that 
p G [0, 1). We state now the main result of this paper: 

Theorem 1. Let {S\,S 2 ) be a bivariate spherical random vector with associate random radius R ~ F, where the 
distribution function F has an infinite upper endpoint and F(0) = 0. Let (X,Y) be a bivariate elliptical random 

vector with stochastic representation (X, Y) = (S\,pSi + -Jl — p 2 S 2 ) 1 p € [0,1). Assume that §3§ holds with the 
scaling function w and Assumption Al is valid with Ai,Bi,i = 1,2 positive measurable functions (with A, B as in 

CD;. ' 

a) If x,y are positive constants such that 
(12) 



y e {px,x\, a PtXt y > c> 1, 



and / °° -B(s) ds < oo, then for all x large 



(13) 



P{X>x,Y>y} = 



with ct p%x ^y as defined in ([7]) and K l 



2tt xw(a p , x , y x) 
P ,x,y given by 



l + O A(a p . x , y x) + 



xw(a PtXt yx) 



(14) 



K. 



P,x,y 



x 2 (\-p 2 fl 2 

0- py)(y- px) 



G (0,oo). 



b) Set h(x) := xw(x),x > 0, and let z x ,x £ R be given constants such that \z x \ < K < oo for all x large. If further 
J °° s _1 / 2 _B(s) ds < oo, then for all x large and y := x[p + z x y/l — p 2 j \Jh(x) + p/h(x)\ we have 



(15) 



P{X >x,Y> y} 



1 1 - F(x) 



[1 - 



l + O A{x) + 



1 



v 7 ^ ^Jhjx) \ h(x) 

where <I> denotes the standard Gaussian distribution on R. 

c) If x, y are positive constants with y < ax, a € (0, p), p > and s~ x / 2 B(s) ds < oo, then 



(16) 



P{X>x,Y>y} = 



1 1 - F{x) 



2tt y/h(x) 



l + O A(x) + 



1 



1 



1 - F{Ctp >Xi yX) 



is valid for all large x. 

Remark 1. a) By the assumption on F (recall dHJ) we have lim a; ^ 00 (a;u;(a:)) _1 = 0. If we assume that lim M ^oo A(u) 
holds, then we have 



OyA{a p . x . y x) + 
Consequently by statement a) in Theorem [T] 



xw(a p . x . y x) 



o(l), x — > oo. 



P{X>x,Y>y} = (l + o(l)) 



1 p,x,y-K p,x,y 1 -^{f- 



2tt 



xw(a p , x ,yx) 
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and if y = ax(l + o(l)), a G (p, 1], then for all large x we have 



P{X>x,Y>y} = (l + o(l)) y ' ? — \~> £^°o, 

27T IB)(Q ft(l l) 



holds with 
(17) 



a . p 



v/(l-2ap + a 2 )/(l-p 2 ) > 1, #T P)0 



(1 - p 2 ) 3 / 2 
(1 -ap)(a- p) 



G (0,oo). 



b) Sufficient conditions for ([9]) to hold are derived in Lemma 7 of Abdous et al. (2007). An instance is when 
lmXj^oo w(tx) / w(x) = 6 > 0, Vt > 0, i.e., the scaling function iu defining _F* is regularly varying at infinity with 
index 8—1. Under the assumptions of Lemma 7 in the aforementioned paper we may choose 

A 1 (u)=0{\(l/w{u))'\), B^x) = (1 + x)~ K , u > 0,x > 0,k > 1. 

c) We have a 2 px y = 1 iff y = px and for any y G (0, x], x > 

1<< XiJ/ <— — , pG(-l,l. 

Hi >H 1 + p 



d) Since © implies that 1 — F is rapidly varying (see e.g., Resnick (1987) or de Haan and Ferreira (2006)) and 
a P,x,y > O > 1, then we have 

lim 1 ~ F{ - a P- x -^ 

x — >oo 

consequently in (fTB"|) we have 



Ol A(x) + 



1 



xw(x) L xw(x 



1 



1 - F(x) 
1 - F(a p . x . y x) 



0, 



1 - F(x) 



O A(x) 



o(l) 



A k /xw(x) 



We give next an alternative expansion of the tail probability under consideration assuming y G (px, x]. 

Theorem 2. Let (X,Y), p, F be as in Theorem^ Suppose that ^ holds with the scaling function w and further 
Assumption Al is satisfied with Ai, Bi positive measurable functions. Let x, y be two positive constants such that ([T! 
holds. If J °° max(l, l/y/s)B(s) ds < oo hods with B as defined in (jllj) . then for all x large we have 



(18) 



P{X >x,Y >y} 



3/2 



y / 2irxw(a PyXiV x 



""' P{-\ > »„,,,,■>■] 



l + O A(a PtX , y x 



xw(a P}X , y x) 



with a p ,x,y, Kp,x,y o,s defined in (0 and (|14| . respectively. 



Next, we consider the implications of our results for the tail asymptotics of the conditional survival probability 
P{Y > y\X > x}. 

Theorem 3. Under the assumptions and notation of Theorem\^we have for all x,y large 



(19) 

where 

(20) 

(21) 



P{Y > y\X > x} 



3/2 u 3/2 u 

&p,X'yft-p,x,y / \ ^p,x,y^ p,x,y / \ 

-gi(a p , x ,y,x) = - 92{.a p , x , y ,x), 



9i{a P , x , y ,x) 



92{ap,x, y ,x) := 



,a:,yX) 1 F(y(X pX yX) 



w(ct p . x .yX) 

P{X > 



1 - F{x) 



2tt 



l + O A(a PiX<y x) 



xw(a p , x , y x) 



l + O A(a p , Xi yx) + 



1 



Furthermore, we have 
(22) 

and for any a G (p, 1], z > 



y / xw(a p . x . y x)P{X > x} 

lim P{Y > y\X > x} = 0, 



xw{a PtXiV i 



(23) 



p|y > ax + z/w(a aiP x)|A > x| 



3/2 u 
Qa,p *^a,p 



exp(-A ai pz)5i(a aiP ,x) 
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(24) = — ' / —?' P cvp(-X atP z)g 2 (a a . p , x) 

V Z7T 

hold locally uniformly with respect to z, with a a ^ p ,K a ^ p as defined in (jTTJ) and 

(25) X pa := Q ~ P >0. 

v /(l-p 2 )(l-2ap + a 2 ) 

Convergence to a positive constant in (|22|) is achieved when y depends on x being close to px for x large. Sec Gale 
(1980), Berman (1982,1983,1992), Abdous et al. (2007), Hashorva (2006,2007) for elliptical case, and Hashorva ct al. 
(2007) for the more general case of Dirichlet distributions. 

Remark 2. a) A slightly more general setup is when Assumption Al is reformulated considering positive measurable 
functions Ai, Bi,i > 1 such that for all x > and u large 

1 - F*(u + x/w(u)) 



1 -F*(u) 



cxp(-x) < ^Ai(u)Bi{x) =: 



i=l 

is valid with lim^—joo supj ;>1 Ai(u) = 0, i > 1, ^(u, x) finite for all x > and u large. 

b) In the asymptotic results above the scaling function w appears prominently. One choice for the scaling function is 
the von Mises scaling function, i.e., w defines the von Miscs distribution function F* in ([5]). We can choose however 
another scaling function w defined asymptotically by 

/ \ (l + o(l))[l--F(u)l 

(26) W(u) := ^j^ , u Uf - 

Note that the bounding functions Ai,Bi,i = 1,2 in the Assumption Al depend on which concrete scaling function 
we choose. In view of Lemma 16 in Abdous et al. (2007) (see its assumptions and Lemma 7 therein) if wf is the von 
Mises scaling function defining F* in ([5|) and w is another scaling function defined by |26|) . then j9]) holds with W 
instead of Wf, and A\ instead of A\, where A\ is defined by 

(27) Miu) = 0(\{l/w F {u))'\ + \w F {u)-w{u)\/w{u)\, Vu > 0. 

If F is a von Miscs distribution function, then we can make use of Lemma 7 in Abdous et al. (2007) (provided the 
assumptions of that lemma hold) . 

We discuss in the next lemma the case F is a mixture distribution. 

Lemma 4. Let Fi,i > 1 be von Mises distribution functions with the same scaling function w and upper endpoint 
infinity. Suppose that the Assumption Al is satisfied for all Fi, i > 1 with corresponding functions w, An, Bn, i > 1, 
and Ai2,Bi2,i > 1 identical to 0. If F is another distribution function defined by 

oo oo 

F(x) = OiFiix), with a.j > 0, i > 1 : a; = 1 

i—l i=l 

for all x large, then F is in the Gumbel max-domain of attraction. If F is the distribution function of the random 
radius R in Theorem [D and Theorem then both these theorems hold with 

oo 

A(u) = y^Agju), u > 0, 

i—l 

provided that Y^TLi a i Jo° Bn{s) ds < oo and A{u) is bounded for all large u with limu^oo A{u) = 0. 

3. Examples 

We present next three illustrating examples. 
Example 1. [Kotz Type I] 

Let (X, Y) = R(Ui, pU\ + \Jl — p 2 ^), p €E [0, 1) with R a positive random radius independent of the bivariate random 
vector (U\ , U2) which is uniformly distributed on the unit circle of IR 2 . We call (X, Y) a Kotz Type I elliptical random 
vector if 

1-F(x) = Cx N eM~cx s ), c>0, C >0,S>0,N el 
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for any x > 0. Set w(u) := cSu , u > 0. For any x € R we obtain 

P{R > u + x/w(u)\ 
hm — — — = expl — x). 

u^oo P{R >U } ^ ' 

Consequently, F is in the Gumbel max-domain of attraction with the scaling function w. Further, we have 

C 



P{X>u} = (l + o(l))- 



i N - 5/2 cxp{-ru s ), u^oo. 



\j2rb~it 

The von Mises scaling function w is given by 

w{u) = cSu^-Nu 71 , r? := — 1. 

Hence (|27|) implies that the approximation under consideration holds choosing A(u) :— Ai(u), u > 0, where Ai(u) := 
0(u~ d ),u > 0. Further we can take Bi(x) = (1 + x)~ k ,k > 1 and B 2 (x) = Q,x > 0. Hence the second order 
assumption in our results above is satisfied. We may thus write for x large and y G (px,x] such that oc PiXiV > c > 1 

Ca 2 - S+N K 

P{X>x,Y>y} = (l + o(l)) " P ^ y x N - s C M-r(a p ,*, y x) s ){l + 0(x- s )} 



2rSiT 



2-5/2 T , 

a„.x.v K 



n N-S/2 K 



(1 + Q)) ?£™j^£^ x -W^^^ + ( 



,-S\ 



\Z2rSir 

2-8/2 „ 



\Z2rSir 



= (1 + o{l)) " p ™J^^ x- s '*P{X > a p , x , v x}[l + 0(x- s )} 

= (1 + o(l))^^P{X* > a p , x , y x}[l + 0(x-% 
V Zroir 

with (X*,Y*) another Kotz Type I random vector with coefficients C, N* = N - 5/2, r, 8. 



Example 2. [Tail Equivalent Distributions] 

Let G be a von Mises distribution function with scaling function w such that ([9]) holds with functions Ai,Bi, and let 
Fj^ Ti j,t > be another distribution function with infinite upper cndpoint. Assume further that 

(28) l-F 1 . T (x) = (l + ax-' 1 + 0(x~ T ^))[l-G(x)}, Vx > 0, 

with 7,t two positive constants. Clearly, F and G are tail equivalent since 

lim "1 rv~T~ = L 

a; -too 1 — G(X) 

Suppose further that w(x) := cSx 5-1 ^ > 0, 5 > 0, and set d(x) := (1 + ax 1 + 0{x~ T ~ 1 )) 1 x > 0. We have for all 
large u and x > 



d(u + x/w(u)) — d(u) 



(1 +i/(m«i(u)))^ - 1 



-r— 7 



l + 0((l + a;/(m/;(u)))- Tr ) 



= Aa(u)B 2 (a:), 

where A 2 (u) := 0(u~^ +min ( T - s ^),u > and B 2 (x) is such that / °° B 2 (x) exp(— x) dx < oo. Hence our asymptotic 
results in the above theorems hold for such F with the function A defined by A(u) := Ai(u) + A 2 (u), u > 0. 



Example 3. [Regularly Varying w] 

Consider F a von Mises distribution function in the Gumbel max-domain of attraction with the scaling function 
w(x) = F'{x)/[l - F(x)],x > defined by 

cSx^^ 1 

w(x) := — = c5x S - 1 (l + o(l)), c>0,<5>0, Vx > 0, 

1 + ti(x) 

which implies (Abdous et al. (2007)) that all x large 

P{R > x] = exp(-ca;*(l + t 2 {x))) 
holds, where U(x), i = 1, 2 are two regularly varying functions with index rj8, rj < 0. 
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Now choosing w(x) := c5x s 1 instead of w(x) we conclude that the second order correction function A\ satisfies 

Ai(«) :=0(u- 5 +u r,s L 1 (u)), u>0, 

with Li(u) a positive slowly varying function, i.e., \\m. u ^ oa Li(tu) / Li(u) = l,t > 0. It can be easily checked that our 
main theorems above hold for this case with A(u) := Ai(u),u > 0. 



4. ESTIMATION OF JOINT SURVIVAL PROBABILITY 

Consider the estimation of the joint survival probability P{X > x,Y > y} for x, y large, with (X, Y) a bivariate 
random vector satisfying the assumptions of Theorem [2l We discuss first the implications of (fT3|) , (fT8|) , and then 
outline the estimation motivated by (| 1 5j) . 

Estimation based on (TT3"]) . (fT5| 

Consider the case x = y is large. The constants ol p , x , x , K PiX ^ x do not depend on x and y (given in (|17p for a = l,p€ 
[0, 1)). Both asymptotic expansions can be written as 

P{X > x,Y > x} = qi(x,p,w,F) = q 2 (x, p,w,G{), 

with F and G^ the distribution function of R and Z^ iP , respectively. 

If (Xi,Yi),l < i < n arc independent copies of {X, Y), then a \fn— consistent estimator of p is available from the 
literature. The more difficult part is the estimation of the tails and the function w. 

If we restrict ourselves to distribution functions F in the Gumbcl max-domain of attraction with von Mises scaling 
function 

w{x) — cSx 5 ' 1 , c> 0, 8>0, x > 0, 
then w can be estimated utilising the techniques in Abdous et al. (2007), where estimators for c and 5 are constructed 
using previous results of Girard (2004), Gardes and Girard (2006). See also the recent contribution Diebolt et al. 
(2007). 

The estimation of the tail 1 — F can be performed for instance if we restrict oursclf to the case of Example 3 
above. Advanced extreme value statistics provide estimation of Gumbel tail and related quantilcs under second order 
assumptions. See the recent deep monographs de Haan and Ferreira (2006), Reiss and Thomas (2007). If we use (fT5)) 
then second order asymptotic condition on the distribution of X need to be imposed. 

Note in passing that the scaling function w appears in the assumption on F, which on the turn implies that both 
X and Y have distribution functions in the Gumbel max-domain of attraction with the same scaling function w. 
Consequently we may estimate w alternatively utilising only the observations Xi, 1 < i < n, or more generally we 
may estimate w from the observations 

rj C^i +Yi d . 

Zc, P ,i = —7= = X, (GR, i = l,...,n. 

VC + 2Cp + 1 

Further, instead of estimating the tail 1 — F we may estimate the tail 1 — . If we use the random points Xi, 1 < i < n 

to estimate w the advantage is that the estimator of p is not involved. The disadvantage is that the second order 
correction is a consequence of an assumption on R and not on X . 

Estimation based on (fT5l) 



If x, y are large positive constants then making use of the approximation in (|15[) we may write (set h{x) := */ xw(x), x > 
0) 

P{X >x,Y>y}* [l - $ ([y/x p p / h(x)W h(x) / (1 



/27T y^hixj 

hence an estimator of the probability of interest can be constructing by the right hand side of the above approximation. 
Again we have the same estimation issues for the tail 1 — F and w(x) where x is large as above. 

5. Estimation of Conditional Survival and Quantile Function 

Let (X, Y), (Xi, Yi), 1 < i < n, be as in the previous section. Our asymptotic results above can be employed for the 
estimation of the conditional excess survival function 



*(y,x) :=P{Y>y\X>x}, 
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where x, y are large. The estimation of the conditional distribution 1 — ^(y, x) is discussed in Abodus et al. (2007). 
Clearly, one way to address this problem is to consider the estimation of the joint and marginal survival probabilities 
P{X > x, Y > y}, P{X > x}, separately. As noted in the aforementioned paper for large values of x the empirical 
distribution function is useless since no observations might fall in the relevant regions. 

Our suggestion for the estimation of ty(y,x) is motivated by the asymptotic relations shown in Theorem [3] Under 
the assumptions of that theorem for all large x and any z G R we have 

3/2 v 

*(x + z/w(a-L,pX),x) = — ,— ' exp(-X ltP z)gi(ai, p , x) 

V Z7T 



3/2 jr 

Oj Alp 

' ■ exp(-Ai ; pz)5 2 (o!i,p, x), 



2tt 

with ol\ p , Ki p and Ai. p as in (fT?)) and (|2l)|) . respectively. 

Let w n , g n i, g n 2, p n be estimators of the function w,g\,g2 and p, respectively, and set 



ai,p„, A„ := Ai,p n , n > 1. 



Then we may estimate ^(y, x) by 



^ n ,i(y, x) := ' Pn c-xv{-\ n w n (a n x)(y - x))g nl (a n ,x) 

(29) =: exp(-\ n w n (a n x)y)g*- L (a n ,x), 

or alternatively 



^n,2{y,x) := -== ,P " cxp(-X n w n (a n x)(y ~ x))g n2 {a n ,x) 

v 2n 



(30) =: cxp(-A n w n (a„x)y)5* 2 (a„,x). 

Let $(q,x),(7 G (0, l),x > be the quantile function defined as the inverse function of 1 — ^(y,x) for x > fix. 
Inverting the above expressions we have also two estimators of the conditional quantile function 

M9ni(®n,x)) - ln(l - q) 



(31) y rlt i(q,x) 
and 

(32) y„,i((j,x) 



X n w n (a n x) 

and 

- ln(l - q) 



X n w n {a n x) 

The difficulty in constructing these estimates lies in the fact that we have to estimate both the tail 1 — F (implicit in 
the estimation of the functions (71,52), and the scaling function w. In the setup of Example 3 above there is a simple 
relation about these functions, and the second order correction is easy to handle. 



6. Proofs and Related Asymptotics 

In the following lemma we derive a formula for the distribution function of a bivariate elliptical random vector. Define 
next for a > 1 and x, y positive constants 

r°° 1 

(33) I(a,x):= [i-F(x8)]—===ds, 

J a S\ S 1 

and 



(34) a p , x , y := V1+ ((y/ x ) - pf/i 1 - P 2 ) > 1 , P P ,x, y ■= a P ,x, y x/y, x, y G M, p G (-1, 1). 



Lemma 5. Let {S\ 1 S2) be a bivariate spherical random vector with associate random radius R which has distribution 
function F. If p G [0, 1) and F(0) = then we have: a) If x > and y G (px,x\ 

1 

27T 1 



(35) P{5i > x, pSi + s/\ - p 2 S 2 > y] = —[I{a p , x , y ,x) + /(/?„,*,„, y)]. 
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b) If y / x < p and x > 0, y > 
(36) 



P{S\ > x, pSi + y/l- p 2 S 2 > y} = — [2I(l,x)-I(a p , Xiy ,x)+I(/3 p , XtV ,y)]. 

Ztt 



Proof. We have the stochastic representation (see Cambanis et al. (1981), Berman (1992)) 



(Si,pSi + yl — P 2 ^) = (5 , 2, p5 2 + yl — p^Si) 

= (5icos(6), 5*2 cos(8 -if,)), 

with uniformly distributed in (— 7r,7r) independent of i? and ip := arccos(/o). For a; > 0,y > two constants we 
may thus write (see Lemma 3.3 in Hashorva (2005b)) 



27rP{i?cos(9) > x, i?cos(9 - ip) > y) 

i-tt/2 



arctan((y / x- p) / y/ 1- p 2 ) 
tt/2 



P{i? > x/co8(9)}d9 



rctan((j//:r-p)/-y/l-p 2 ) 



P{i? > x/cos(6>)}d6> 



arccos(p)— 7r/2 
n/2—ip 



arctan((j//a:-p)/\/l-p 2 ) 

+ I((3 P ,x,y,y), 



P{R> yj cos(0 -ip))}d9 
P{R > y/ cos{6 + ij))}d6. 



'arctan((y/a;-p)/^/l-p 2 ) 

As in Abdous et al. (2007) we obtain for y/x > p 

2irP{Rcos(@) > x, i?cos(9 - V) > y} 
and if y/x < p with x, y positive 

27rP{i?cos(9) > x,i?cos(6 - V) > y} = 2/(1, x) - I(a p>x , y , x) + I(P p , x , y , y), 
with a PtXjV , Pp,x,y as defined in (|34p , hence the proof is complete. 



□ 



Note in passing that 

(Si cos(0), 52 cos(0 — arccos(p))) = (Si cos(0), 52 sin(6 + arcsin(yo))), 

which leads to the alternative formula derived in Abdous et al. (2007) and Kliippelberg et al. (2007) for the tails 
of elliptical distributions. Remark further that some alternative formulae for the distribution of bivariate elliptical 
random are presented in Lemma 3.3 in Hashorva (2005b). 

In the next lemma wc consider a real function a(x) > 1, Vx G R. We write for notational simplicity a instead of a(x). 

Lemma 6. Let F satisfy §3§ with the scaling function w and Xf = oo, F(0) = 0. Assume further that the Assumption 
Al holds. 

i) If J Q B(s) ds < oo, then for any function a := a(x) > 1, x G R and x large we have 



(37) 



[1 - F(xs) 



1 



: ds 



1 - F(ax) 



i\fa 2 - 1 xw(ax) 



l + O A(ax) + 



1 



xw(ax) 



ii) Ifz x ,x £ R is such that for all x large < z x < K < oo and further f^°B(s)/^/sds < oo, then for all large x we 
have 



(38) 



[1 - F(xs) 



1 



5 V* 



: ds 



1 - F(x) 

y/xw(x) 



^[l - $(V2T X 



l + O A(x) 



1 



xw(x) 



Proof, i) Let x be a given positive constant. Set 

a := a(s), v a (s) := sw(as), s > 
and define /(a, x), a PtXj y, P P , x , y as in (|33[) and (|34[) . respectively. Transforming the variables we have 

l-F(ax) f°° 1 - F(ax + s/w(ax)) 1 



I(a,x) = 



V a (x) 



Further ([9]) implies for any s > and all x large 

1 — /"(ax + s/w(ax)) 



1 - F(ax 



1 - F(ax) ( a + s/v a (x))^/(a + s/v a (x)f - 1 
exp(-s) < A(ax)B(s). 



ds. 
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Hence utilising further the Assumption Al we may write for all x large 



v a {x) 



1 - F(ax) J a 
< 



[1-F(xs)}- 



1 



ds 



exp(— s) 



s\/ s 2 — 1 Jo ay/ a 2 — 1 
1 — F(ax + s/w(ax)) 



ds 



1 - F(ax 

/>OC 

/ exp(— s 
Jo 



exp(-s, 
1 



1 



(a + s/v a {x))^/ (a + s/v a (x)) 2 - 1 
1 ds 



(a + s/i; a (a;)) x /(a + s/u a (a;)) 2 - 1 a^a 2 - 1 



< 



1 



1 



iVa 2 ^! 



A(ax) / B(s)ds + 0(— -) 

V a {X) 



thus the first claim follows. 



ii) Next we consider the second case a(x) := 1 + z x /(xw(x)),x > 0. Set h(x) := xw{x),x > 0. Transforming the 
variables we obtain for all x,y positive 



1(1 + z x /h(x), x) = 



l-F(x) [°° 1 - F(x + s/w(x)) 



1 - F(x) 



y/h(x)(l + S /h(x))y/(l + S /h(x))*-l 



: ds. 



Hence since s 1 / 2 B(s) ds < oo for all x large we have (recall Assumption Al) 
1 - F(x + s/w(x)) 1 

1 " F(x) y/K(x)(l + s/h(x))y/(l + s/h(x)) 2 - 1 



ds 



cxp(— s) 



2s 



ds 



< 



1 - F(x + s/w(x)) 



1 - F(x) 

POO 

/ exp(— s 

J Z-r- 



exp(— s) 



y/h@)(l + s/h(x))y/{l+s/h{x)) 2 -1 



ds 



1 



y/h(x)(l + s/h(x))y/(l + s/h(x)) 2 - 1 y/2s 



ds 



1 If 00 
< A(x) I B(s)^ds + 0(-—) / cxp(-s)yjds 
V2.s /i(a;) Jo 

1 



0(A{x) 



h(x 



■)■ 



Consequently since z x is bounded for all x large 

1 - F(x) 



I(l + z x /h(x),x) 



VHx) 

1 - F(x) 



exp(— s) ■ — ds 
W2s 



1 + + 



2tt[1 - $(V2i^)] 



1 + 0(j4(s) 



is valid with $ the standard Gaussian distribution function on R. Thus the proof is complete. 



□ 



Proof of Theorem Q] Define I(a,x) and a p31 y, Pp,x,y as in (|33[) and p4|) . respectively. Assumption Al implies 
that for any x > 0, e > and w large we have 

1 - F(u + x/w(u)) 



1 - F(u) 



exp(— x) 



< 



d(u + x/w(u)) 1 — F*(u + x/w(u)) 



d(u) 



exp(— x) + cxp(— x) 



d[u + x/w(u)) 



d{u) 



1 



1 -F*(u) 

< (1 + e)A l (u)B 1 {x) + exp(-x)A 2 (u)B 2 (x), 

with F* the von Mises distribution function given in ([5]) . We assume for simplicity in the following that the function 
d(-) is a constant for all x large, impying A 2 {u) = for all u large. 
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a) Let x, y be two positive constants such that y > px and ot p , x>y > c > 1. In order to complete the proof we need a 
formula for the survival probability P{X > x, Y > y}. In view of (|35|) we have for y > px and x, y positive 

2nP{X > x, Y > y] = I(a p . x . v , x) + I{/3 p , x , y , y). 

Since further ot p%x ^ y > c > 1, applying Lemma[n]we obtain 

2irP{X >x,Y >y} 



1 1 - F(a p . x . y x) 



1 + A(a PiX>y x) + 



1 F {CZ p X yX^) 



a p , x , y (x/y)^al x Jx/ y y - 1 M<*p,*,yx) 

1 - F(a p , Xi yx) 
xw(a p , Xi yx) 

1 - F(a p , x , y x) 



xw(a P , x , y x) J 
l + 0\ A(a p , x , y x) + 



a P,x,y\l a P ,x,y 



1 ct p , X:V Ja 2 p!X}y {x/y) 2 - 1 



xw(a p , x . v x) 
+ 0\ A(a p , x , y x) + 



xw{a PjX .yx) 



xw{a p . x .yx) 



x 2 {\- p 2 f/ 2 a p ^ y 
(y - px){x - py) 



+ 0\ A(a p . x . y x) + 



xw{a p . x . v x) 



b) Let z Xl x £ K be constants bounded for all x. We may write for all x, y 



P\X > x Y > v\ 

P{Y > y\X >x} = P{X > x} 1 ' yS =: P{X > x} X (x, y). 

.F{A > a;) 



For any a: positive 



P{X > x} = -1(1, x) 



hence Lemma [6] implies for all x large enough 

P{X > x} = 



1 1 - F(x 



1 + + 



1 



/27T y7i(x) 

with := xw(x),x > 0. In view of Theorem 3 in Abdous et al. (2007) we have for all large x 

1 



X (x,y) = [l-$(z x )] 



l + O A(x) 



h(x) 



Hence for all large x 



P{X>x,Y>y} 



1 1 - F(x) 



[1 - *(z x ) 



l + 0[A(x) + 



1 



h(x) 



V2k vHx) 

thus the result follows. 

c) Now we consider the last case. By (|36[) for all x, y large y < ax < px (implying y G (0, x]) 

2ttP{X > x, Y > y} = 2/(1, x) - I{a p , x>y , x) + I(f3 p , x , y , y), 



where a p . x . y > \Jl + (a — p) 2 /{l — p) 2 > 1 and (i p , x , y ■= a p ^ y x/y > 1. By the above results we have for all y < ax 
and x, y large enough 



P{Y > y\X > x} = 



1 1 - F(x) 



/27T yfhjx 

thus the proof is complete. 

PROOF of Theorem [2] From the proof of Theorem [T] we obtain for all large x 



□ 



P{X > x} 



1 - F(x) 
^2irxw(x) 



l + 0\A(x) 



1 

xw(x) 
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Since further a p ^ x ,y > 1 and it is bounded for all x large and y positive such that (12) holds, we may write for all x 
large 



(39) 



P{X > <>,,..,..,,.r\ = 



1 F{oip X yX 



y/2ira PtXt yXw(a PiXi yx) 



1 + A(a p , X)V x) + 



1 



xw{a p , x ,yx) 



Applying Theorem [T] we have 

P{X >x,Y >y} 



3/2 



K 



1 F(yQL pX yX^ 



y / 2irxw(a PiXiV x)iT P{X > a p , x ^ y x\ ^2-Ka p . x . y xw(a p ^^x) 

1 



P{X > 



1 + A(a PtXt yX 



xw(a PtX:V x) 



3/2 ^ 
Qp.x.y p,x,y 

y/2TTXw(a p , Xi yx) 

3/2 „ 
Qp,x,y**- p,x,y 



P{X > 



l p,x,y** 



1 + A(c 



1 



-.P{X > a p , x ,yx} 



l + O A(a 



p> x ,y 



xw(a p ,x, y x) 
xw(a PtX ,yx) 



□ 



^27rxw(a P;2;!y x) 

Thus the result follows. 

Proof of Theorem [3] The proof of the first claim follows easily since 

P{Y > x\X > x} = P{X > x, Y > y}/P{X > x}, Vx > 

utilising further the results of Theorem Q] and ((39)) . The fact that the distribution function F is in the Gumbcl 
max-domain of attraction with scaling function w implies that also the random variable X has distribution function 
in the Gumbel max-domain of attraction (Berman (1992), Hashorva (2005b)) with the same scaling function w. If y 
are positive constants such that a. p , x , y > c > 1 holds, then for all large x 



P{X > 



*-p,x,y' 



lim 

x^oo P{X > x} 



= o, 



hence the result follows. 



□ 



Proof of Lemma [4] Since Fj,i > 1 are in the Gumbel max-domain of attraction with the same scaling function w 
if follows easily that the distribution function F is in the Gumbel max-domain of attraction with the same scaling 
function w. It can be easily checked that both Theorem [2] and Theorem [3] hold with A(u) :— An(u), u > 0, 

hence the result follows. □ 
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